Picture for Dakuo Wang

Dakuo Wang

Northeastern University, USA

Trajectory2Task: Training Robust Tool-Calling Agents with Synthesized Yet Verifiable Data for Complex User Intents

Add code
Jan 28, 2026
Viaarxiv icon

Agentic Conversational Search with Contextualized Reasoning via Reinforcement Learning

Add code
Jan 19, 2026
Viaarxiv icon

Rethinking the Value of Multi-Agent Workflow: A Strong Single Agent Baseline

Add code
Jan 18, 2026
Viaarxiv icon

See, Think, Act: Online Shopper Behavior Simulation with VLM Agents

Add code
Oct 22, 2025
Viaarxiv icon

DPRF: A Generalizable Dynamic Persona Refinement Framework for Optimizing Behavior Alignment Between Personalized LLM Role-Playing Agents and Humans

Add code
Oct 16, 2025
Viaarxiv icon

Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping

Add code
Oct 08, 2025
Figure 1 for Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
Figure 2 for Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
Figure 3 for Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
Figure 4 for Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
Viaarxiv icon

SurgWound-Bench: A Benchmark for Surgical Wound Diagnosis

Add code
Aug 21, 2025
Viaarxiv icon

Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation

Add code
Jul 28, 2025
Figure 1 for Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation
Figure 2 for Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation
Figure 3 for Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation
Figure 4 for Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation
Viaarxiv icon

Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning

Add code
Jul 23, 2025
Figure 1 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Figure 2 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Figure 3 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Figure 4 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Viaarxiv icon

OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation

Add code
Jun 05, 2025
Figure 1 for OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation
Figure 2 for OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation
Figure 3 for OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation
Figure 4 for OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation
Viaarxiv icon